CDS

Accession Number TCMCG052C05329
gbkey CDS
Protein Id CAB4286609.1
Location complement(join(8221411..8221791,8221869..8221999,8222428..8222498,8222591..8222668,8223333..8223423,8223633..8223709,8224558..8224646,8224732..8224862,8224948..8225077,8226409..8226433,8227092..8227122,8227199..8227583))
Organism Prunus armeniaca
locus_tag CURHAP_LOCUS44111

Protein

Length 539aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJEB37669, BioSample:SAMEA6812185
db_source embl accession CAEKDK010000007.1
Definition unnamed protein product [Prunus armeniaca]
Locus_tag CURHAP_LOCUS44111

EGGNOG-MAPPER Annotation

COG_category K
Description Transcription initiation factor
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K03124        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03022        [VIEW IN KEGG]
ko05169        [VIEW IN KEGG]
ko05203        [VIEW IN KEGG]
map03022        [VIEW IN KEGG]
map05169        [VIEW IN KEGG]
map05203        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCCGATGCGTATTGCTCCGACTGCAAGCGCCAGACGGAGGTGGTTTTCGACCACTCGGCGGGAGACACGGTGTGCTCCGAGTGTGGATTAGTGTTGGAATCTCATTCCATCGACGAGACCTCAGAGTGGCGTACCTTCGCGAACGAGTCCGGCGATAACGACCCGGTCCGTGTTGGTGGGCCCACCAACCCACTATTGGCCGATGGCGGGTTGTCCACCGTCATAGCCAAGCCTAATGGGGCATCTGGTGAGTTCTTATCTTCGTCGTTGGGGCGGTGGCAGAACCGTGGGTCGAATCCGGATCGGGGCTTGATTCTGGCCTTCAAAACCATCGCCACCATGTCTGATAGGTATGTGCTTGTTGCTGCGATATTGTGGGAAGGGTTGAAGTTGGATGAGAAAGATGATGATTGGTTGGGCCTTGTTGCAACCATCAAGGATCGGGCTAATGAGATATATAAGAGGGTGGAAGATCAGAAATCTAGTAGAGGAAGAAATCAGGACGCATTATTGGCTGCTTGCCTATATATTGCTTGTCGACAAGAAGACAAGCCACGCACTGTAAAGGAAATTTGCTCCGTCGCCAATGGAGCCTCAAAGAAGGAAATTGGCCGAGCAAAAGAATATATAGTGAAACAACTGGGACTAGAGAAGGGTCAGTCAGTGGAGATGGGAACAATACATGCCGGGGACTTTATGAGGCGCTTCTGTTCCAATCTTGGAATGAATAATCAAGCAGTCAAAGCTGCCCAAGAAGCTGTGCAAAAGTCGGAAGAATTTGATATAAGGAGGAGCCCTATATCGATAGCAGCAGCAGTTATTTATATTATCACTCAGCTTTCAGACGATAAGAAGCCTCTCAAAGATATCTCAGTTGCTACAGGAGTAGCAGAAGGAACAATCAGAAACTCATACAAAGACCTTTATCCCCATGTATCAAAGATAATACCCAACTGTCAAAATTCTTCTTCTCTTACTTCGATTCGAGTTGAATCATTTCTCGATGAATTGAAAGTAGTCACTGGATGCTCGTGTTTGGCCCTGGTAGGAGGGGGTCCAAGCCCCATGTTGCACTTCCAGCGGGCTTCAGCAGCTCGGGCAGGAGATCGGGCCACTCGTATGGAAATGGCCTTGGCTAGTAGGTGGAAAATTTTGAATAAAGAGTTAGGGAAATGGAGAGATGCGTTGACAAAAGCGAGGGATAACATTCAAAGAGGCCAAAATCTTGGCAACGAGCACAAATGTGGCTCGGTGCCATTAGTTAGGCAAAAAATCTTCAACAATCACCAATGTTGGGAGGTGATGAAAAATTGTTCAAGATTCAAAATTATTCCCACGGGTCCGACCGTTGTGTTGAATGAGACGTCGCTCCACAATTCAACGACATTGGATTCGCCTTTGGATTTCTCAATGAATCAAGACTCACCAATCCAAAAGGAGCCAAGGCCTATTGGGAGAAAGGCGGCGAAGGCCAACAGAGGGAGTAAGTCCACCAATGATACTGCAAAAATTTTGGAGCAAATTGCTCTAAACGGCACAATGAGAATTGAGAGAGACATGAAAAGAGATTTGGATGAGAGGACAATGTATGAAGAATTTGCAAGAGAAAGGGAGTAA
Protein:  
MSDAYCSDCKRQTEVVFDHSAGDTVCSECGLVLESHSIDETSEWRTFANESGDNDPVRVGGPTNPLLADGGLSTVIAKPNGASGEFLSSSLGRWQNRGSNPDRGLILAFKTIATMSDRYVLVAAILWEGLKLDEKDDDWLGLVATIKDRANEIYKRVEDQKSSRGRNQDALLAACLYIACRQEDKPRTVKEICSVANGASKKEIGRAKEYIVKQLGLEKGQSVEMGTIHAGDFMRRFCSNLGMNNQAVKAAQEAVQKSEEFDIRRSPISIAAAVIYIITQLSDDKKPLKDISVATGVAEGTIRNSYKDLYPHVSKIIPNCQNSSSLTSIRVESFLDELKVVTGCSCLALVGGGPSPMLHFQRASAARAGDRATRMEMALASRWKILNKELGKWRDALTKARDNIQRGQNLGNEHKCGSVPLVRQKIFNNHQCWEVMKNCSRFKIIPTGPTVVLNETSLHNSTTLDSPLDFSMNQDSPIQKEPRPIGRKAAKANRGSKSTNDTAKILEQIALNGTMRIERDMKRDLDERTMYEEFARERE